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I.  INTRODUCTION 


The  determination  of  the  most  efficacious  method  of  transmit¬ 
ting  speech  signals  in  narrowband  form  has  been  a  major  focus  of 
speech  researchers  and  communications  engineers  back  as  far  as 
1928  when  Homer  Dudley  introduced  a  device  called  a  "vocoder” 1 . 

The  development  of  efficient  speech  coding  methods  continues  to 
be  an  area  of  critical  concern,  especially  for  the  Air  Force, 
because  the  ability  to  manipulate  speech  signals  over  communica¬ 
tions  transmission  channels  through  efficient  and  versatile 
coding  procedures  will  enhance  air-to-air  and  air-to-ground 
reconnaissance  and  surveillance. 

Significant  advancements  have  been  made  in  speech  coding 
technology  since  Dudley's  prototype  vocoder.  Moreover,  the  narrow- 
band  linear  predictive  coding  (LPC)  technique  has  become  widely  used 
in  both  civilian  and  military  applications^.  However,  in  spite  of 
its  recent  widespread  use,  in  spite  of  the  many  improvements  made  in 
it  and  in  spite  of  its  adoption  as  the  military  standard,  the  LPC 
vocoder  still  has  not  undergone  a  systematic  and  comprehensive 
evaluation  for  general  and  practical  applications. 

The  present  project  was  designed  to  test  an  LPC-10  vocoder's 
performance  during  the  processing  of  speakers  whose  voices  represent 
vocal  extremes  in  respect  to  pitch  and  quality.  The  impetus  for 
this  study  emanated  from  the  following  general  hypotheses  and 
findings  reported  in  recent  signal  and  speech  transmission  liter¬ 
ature: 

1.  A  speaker's  voice  quality  has  a  clear  effect  on  the 
subsequent  LPC  quality3. 

2.  The  efficacy  of  LPC  analysis  has  been  shown  to  be  dependent 
on  the  fundamental  frequency  of  the  voiced  signal4. 

3.  Narrowband  LPC  notoriously  lacks  robustness5. 

4.  -LPC  ha3  difficulties  with  the  voices  of  women  and 
children** . 

5.  The  presence  of  whisper  and  nasality  has  a  negative 
influence  on  intelligibility  for  both  male  and  female  LPC  quality?. 

Although  a  few  studies  have  examined  the  performance  of  LPC 
vocoders  for  design  purposes,  the  "robustness,"  i.e.,  their 


ability  to  process  a  variety  of  voice  types  and  speaking  styles 
during  practical  user  applications  is  yet  to  be  determined. 
Information  pertaining  to  the  robustness  of  LPC  vocoders  is  of 
utmost  importance  to  signal  transmissions  personnel  and  to 
communications  engineers  because  such  information  should  aid  in 
continued  development  and  improvement  of  speaker  independent 
speech  recognition  technology. 


II.  OBJECTIVES 


The  overall  objective  of  the  present  project  was  to 
determine  the  robustness  of  the  LPC-10  vocoder  by  assessing  its 
ability  to  process  voices  with  pitch  and  quality  extremes.  The 
specific  objectives  were: 

1.  To  evaluate  the  ability  of  the  LPC-10  vocoder  to  process 
phonated  and  whispered  speech. 

2.  To  evaluate  the  ability  of  the  LPC-10’  vocoder  to  process 
voices  representing  different  sexes,  i.e.,  voices  with  funda¬ 
mental  frequency  and  formant  frequency  extremes. 

3.  To  determine  the  acoustic  characteristics,  i.e.,  the 
formant  frequencies,  formant  bandwidths  and  formant  amplitudes  of 
male  and  female  phonated  and  whispered  vowels  during  connected 
speech. 

Whispered  and  phonated  voice  characteristics  were  selected 
as  the  focus  of  this  study  because  (1)  they  represent  two 
extremes  of  voice  production  or  laryngeal  activity®,  (2)  one  of 
the  concomitants  of  human  fatigue  is  dysphonia  which  may  range 
from  partial  to  complete  loss  of  voice,  and  (3)  previous  research 
has  indicated  the  formant  frequencies  of  whispered  isolated 
vowels  are  higher  than  those  of  phonated  isolated  vowels9.  Male 
and  female  voices  were  used  because  the  habitual  pitch  for  female 
voices  is  almost  twice  the  habitual  pitch  for  male  voices. 

The  LPC  technique  was  of  particular  concern  to  this  study 
because,  as  pointed  out  earlier,  it  has  been  adopted  as  the 
military  standard.  Purportedly,  this  technique  is  especially 
attractive  to  military  communications  personnel  because  (1)  only 
a  short  segment  of  speech  is  needed  to  yield  accurate  results, 

(2)  it  is  suitable  for  analyzing  high  pitched  voices,  such  as 
women's  and  children's,  and  (3)  data  rate  can  be  reduced  to 
approximately  2400  bits/sec.  without  producing  degradation  in 
speech  quality^. 
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III.  METHOD 


Subjects 

Seven  young  adult  males  and  four  young  adult  females  who 
demonstrated  normal  speech  and  hearing  characteristics  produced 
the  speech  samples  for  this  study.  Their  ages  ranged  from  19-32 
years  and  17-24  years,  respectively.  The  mean  age  for  males  was 
23.6  years  and  the  mean  age  for  females  was  21.3  years.  All  of 
the  subjects  grew  up  in  the  northeastern  region  of  the  United 
States  and  spoke  General  American  English  dialect. 

Speech  Sample 

Each  of  the  11  subjects  was  instructed  to  first  normally 
phonate  a  list  of  five  sentences  and  secondly  to  whisper  the  same 
list.  Embedded  in  each  of  the  five  stimulus  sentences  was  an 
"h_d"  word  in  which  were  embedded  the  experimental  vowels:  /i/, 
/ae/,  /u/,  /a/  and  /  /  (see  Figure  1). 

These  particular  vowels  were  selected  because  they  represent 
the  cardinal  or  articulatory  extremes  for  English  vowel  production 
(see  Figure  3). 

Instrumentation 

The  LPC-10  vocoder  was  the  speech  processing  device  used  in  the 
present  study.  The  LPC-10  is  a  time-domain  device  that  analyzes  and 
synthesizes  speech  using  the  principles  of  linear  predictive  coding. 
Linear  predictive  coding  is  a  speech  modeling  technique  which 
approximates  a  given  speech  signal  as  a  linear  combination  of  past 
samples  of  a  hypothetical  input  to  a  system  whose  output  is  the 
given  speech  signal.  The  predictor  coefficients  which  become  the 
parameters  of  the  digital  analysis  filter  are  determined  by 
minimizing  the  squared  differences  between  the  actual  speech  samples 
and  the  linearly  predicted  ones. 

Since  the  vocoder  analyzes  a  given  speech  sample  before  it 
reconstructs  it  from  the  analysis  data,  it  is  possible  to  remove 
information  redundancy  or  to  compress  a  speech  sample  if  desired. 

It  should  be  noted  that  the  device  used  in  the  present  study 
was  computer  simulation  of  the  LPC-10  vocoder.  For  testing 
purposes,  the  computer  simulated  version  of  the  vocoder  actually 
allowed  more  flexibility. 


Recording  Procedure 

Each  of  the  subject's  speech  samples  was  stored  by  recording 
it  on  a  Uher  Model  4000  Report  IC  reel-to-reel  magnetic  tape 
recorder  while  the  subject  was  seated  in  an  IAC  sound  attenuating 
booth.  Each  subject  was  allowed  to  practice  the  list  of 
sentences  under  the  whispered  and  phonated  conditions  until  he 
felt  comfortable  with  his  productions.  An  Uher  Model  M518 
Dynamic  Microphone  was  positioned  5  inches  in  front  of  each 
subject's  mouth,  slightly  below  the  level  of  the  lips  during  the 
recording  of  the  samples.  Any  samples  that  the  investigator 
judged  as  faulty  during  his  monitoring  of  the  recording  sessions 
were  repeated  by  the  subject  until  an  acceptable  sample  was 
produced. 

Analysis  Procedure 

Prior  to  analyzing  the  speech  samples,  each  of  the  vowel- 
embedded  words  was  isolated  from  the  carrier  sentences  by 
mechanical  hand  splicing.  One  foot  of  leader  tape  was  spliced 
onto  each  side  of  each  of  the  vowel-embedded  words  so  that  the 
vowel  samples  could  be  fed  continuously  into  the  computer  during 
analysis.  During  the  splicing  procedure,  the  phonated  and 
whispered  productions  of  a  given  vowel  were  paired  so  that  they 
could  be  subsequently  analyzed  and  displayed  together. 

Thus  each  of  the  11  subjects  had  a  total  of  5  pairs  of 
vowel-embedded  words  which  were  fed  from  the  tape  recorder  output 
directly  into  the  analog-to-digital  preprocessor  (see  Figure  2). 
For  example,  a  phonated  /i/  and  a  whispered  /i/  were  digitized 
and  analyzed  together,  and  so  on. 

The  digitization  process  consisted  of  low  pass  filtering  each 
of  the  sample  pairs  at  5000  Hz  with  a  sampling  rate  of  10000  Hz. 
Once  the  sample  pairs  were  digitized,  they  were  stored  on  a  disk 
as  a  sampled  data  file  or  primary  file. 

At  this  point,  the  linear  predictive  coding  analysis  phase 
was  initiated  by  giving  the  computer  the  API  command  which 
instructed  the  simulated  vocoder  to  analyze  and  model  the 
spectral  characteristics  of  the  input  data  (the  digitized  sample 
pair  in  the  sampled  data  file)  using  the  linear  predictive  coding 
method.  In  addition,  the  excitation  of  fundamental  frequency 
status  was  determined  using  a  modified  cepstral  processing 
technique. 

Upon  completion  of  the  LPC  analysis  via  the  API  execution 
the  analysis  data  were  stored  on  a  disk  and  designated  as  the 
secondary  file  or  analysis  file.  From  the  analysis  file,  a 
variety  of  acoustic  parameters  could  be  extracted  (See  Figure  2) 
once  the  appropriate  command  was  given. 
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The  first  data  analysis  output  program  was  the  speech  spectro¬ 
gram  (SGM).  This  program  computed  a  frequency  spectrum  of  the 
speech  samples  at  specified  points  in  time  by  performing  a  Fast 
Fourier  Transform  (FFT)  on  the  coefficients  that  were  determined 
during  the  LPC  analysis.  When  SGM  was  completed,  the  formant 
frequencies,  formant  bandwidths  and  formant  amplitudes  became 
available  in  the  analysis  file. 

Following  the  SGM  program,  a  Formant  Tracking  (FTR)  program 
was  executed  which  essentially  provided  a  trace  of  the  formant 
trajectories  on  the  previously  completed  digital  spectrogram  (see 
Figure  5). 

At  this  point,  the  fundamental  frequency,  formant  frequency, 
formant  amplitude  and  formant  bandwidth  data  were  printed  out  by 
activating  the  Interactive  Editing  (IAE)  mode.  Thus,  for  each  of 
the  speech  samples,  the  fundamental  frequencies,  formant  frequencies, 
formant  bandwidths,  and  formant  amplitudes  were  printed  out  for  8 
frames  or  points  in  time  (duration  of  each  frame=6.4  ms)  from  the 
center  of  each  of  the  vowel  samples. 

From  the  analysis  data  file,  three  additional  graphic  displays 
of  the  phonated  and  whispered  speech  samples  were  produced:  SPL 
(Spectral  Plot),  FPL  (Frequency  Plot)-,.  FDI  (Raw  Spectral  Plot) 
and  VTR  (Vocal  Tract  Plot).  Examples  of  these  displays  may  be 
observed  in  Figures  6-10,  respectively.  The  SPL  display  is  a 
three-dimensional  plot  of  formant  frequency  and  amplitude  as 
functions  of  time.  The  FPL  display  is  essentially  the  same  as 
the  SPL  display  except  that  the  spectrum  of  each  frame  is  dis¬ 
played  separately  allowing  a  more  detailed  observation  of  formant 
frequency  and  amplitude  variations  over  time.  The  FDI  display  is 
a  raw  spectral  display  of  the  harmonics  as  well  as  the  formants 
for  a  selected  segment  of  a  given  sample.  A  smoothing  curve  (SSP) 
may  be  superimposed  on  the  raw  spectrum  to  show  actual  formant 
peaks  and  peak  amplitudes. 

The  VTR  display  shows  the  vocal  tract  configurations  that 
produced  the  various  formant  frequencies  at  specific  points  in 
time. 


V.  RESULTS 


Twenty  (20)  phonated  and  20  whispered  speech  samples  were 
identified  as  meeting  the  target  vowel  productions  for  the  female 
subjects  while  35  phonated  and  35  whispered  samples  were  identi¬ 
fied  as  meeting  the  target  vowel  productions  for  the  male 
subjects.  Thus  40  adult  female  vowels  and  70  adult  male  vowels 


taken  from  connected  speech  provided  the  analysis  data  for  this 
study.  The  mean  values  obtained  from  the  LPC  acoustic  analyses 
are  tabulated  in  Tables  1-9. 

Males 

An  observation  of  the  data  in  Table  1  indicates  that  FI  is 
higher  in  frequency  for  all  five  vowels  when  they  are  whispered 
than  when  they  are  phonated.  For  the  whispered  vowels,  FI  was, 
on  the  average,  147  Hz  higher  than  FI  for  the  phonated  vowels. 

Table  2  indicates  that  the  FI  bandwidths  followed  a  similar 
trend.  The  FI  bandwidths  for  the  whispered  vowels  were,  on  the 
average,  59  Hz  wider  than  the  FI  bandwidths  for  the  phonated  vowels. 

The  FI  amplitudes  followed  a  reverse  trend  as  can  be 
observed  in  Table  3.  The  FI  amplitudes  were,  on  the  average,  9  dB 
greater  in  magnitude  for  the  phonated  vowels  than  the  whispered 
vowels.  However,  Table  3  also  indicates  a  shift  in  energy  toward  F3 
for  the  whispered  vowels  whereas  most  of  the  spectral  energy  was 
concentrated  at  FI  and  F2  for  the  phonated  vowels.  The  F3 
amplitudes  are,  on  the  average,  2  dB  greater  in  magnitude  for  the 
whispered  vowels  than  F3  for  the  phonated  vowels.  It  may  also  be 
observed  that  FI ,  F2  and  F4  amplitudes  were  greater  for  the  phonated 
vowels  for  the  male  speakers.  One  of  the  more  salient  differences 
between  phonated  and  whispered  vowels  then,  was  the  shift  of 
spectral  energy  toward  F3  for  whispered  vowels.  Figures  6,  7  and  10 
provide  graphic  examples  of  this  trend. 

The  same  trends  reported  for  FI  center  frequencies  for  phonated 
and  whispered  vowels  may  be  observed  for  FI,  F3  and  F4.  Formants  2, 
3  and  4  for  the  vowel  /i/,  however,  were  higher  in  frequency  for  the 
phonated  vowels  than  for  the  whispered  vowels. 

No  discernible  pattern  of  difference  can  be  observed  for  F2,  F3 
and  F4  bandwidths,  although  the  F2  bandwidths  were  wider  for  all  of 
the  whispered  vowels  than  for  the  corresponding  phonated  vowels  with 
the  exception  of  the  vowel  /u/  (see  Table  2). 

As  shown  in  Table  3>  all  of  the  formant  amplitudes  were  greater 
for  the  phonated  vowels  except  the  Formant  3  amplitudes.  As  was 
noted  earlier,  the  majority  of  the  spectral  energy  was  concentrated 
at  F3  for  the  whispered  vowels. 

It  can  also  be  observed  in  Table  1  that  the  mean  fundamental 
frequency  (F0)  for  the  male  phonated  vowels  was  126  Hz  while  the 
mean  F0  for  the  whispered  vowels  was  0  Hz.  The  0  values  for  the 


whispered  vowels  verified  that  indeed  the  whispered  vowels  were 
not  voiced  or  phonated  (whisper  is  defined  as  the  production  of 
normally  phonated  sounds  without  voice11).  The  126  Hz  mean  F0  is 
also  very  close  to  the  normative  value  given  in  the  literature 
for  adult  male  habitual  pitch. 

Females 

An  examination  of  Table  4  indicates  that  the  mean  FI  frequen¬ 
cies  for  females  were  on  the  average  167  Hz  higher  for  the  whispered 
vowels  than  for  the  phonated  vowels.  The  same  trend  was  observed 
earlier  for  the  male  subjects. 

The  FI  and  F2  bandwidths  tabulated  in  Table  5  were  wiser  for 
the  whispered  than  for  the  phonated  /i/,  /u/  and  /a/  while  the 
opposite  trend  for  the  vowels  /ae/  and  /  /  is  indicated.  The  FI 
bandwidths  were  on  the  average  57  Hz  wider  for  the  whispered  vowels 
than  for  the  phonated  vowels  while  the  F2  bandwidths  were  on  the 
average  96  Hz  wider. 

As  was  observed  with  the  male  speakers,  F2  center  frequencies 
were  higher  for  all  of  the  whispered  vowels  than  for  the  phonated 
vowels  with  the  exception  of.  the  vowel  /i/  (see  Table  4).  No 
discernible  pattern  of  difference  was  observed  for  F3  and  F4  center 
frequencies,  bandwidths  and  amplitudes. 

The  mean  fundamental  frequency  for  the  female  phonated  vowels 
was  217  Hz  and  the  mean  F0  for  the  whispered  vowels  was  0  (Table 
4).  As  was  pointed  out  for  the  males,  the  0  Hz  F0  for  the  female 
whispered  vowels  verified,  by  definition,  the  absence  of  glottal 
activity  in  female  whispered  vowels. 

The  217  Hz  mean  F0  is  almost  identical  to  the  220  Hz  norma¬ 
tive  habitual  pitch  commonly  reported  in  the  literature  for  young 
adult  females. 


VI.  DISCUSSION  AND  CONCLUSIONS 


A  comparison  of  the  male  and  female  phonated  and  whispered 
vowel  spectra  was  produced  by  the  LPC-10  vocoder  indicated  higher 
FI  center  frequencies  under  the  whispered  condition  than  under 
the  phonated  condition  for  all  of  the  experimental  vowels.  The 
actual  mean  differences  were  147  Hz  and  167  Hz  for  the  males  and 
females,  respectively.  The  identical  finding  was  reported  by 
Kallail  and  Emanuel  (1984).  However,  the  present  study  used  male 
and  female  vowels  in  connected  speech  wherein  the  Kallail  and 
Emanuel  study  used  female  vowels  spoken  only  in  isolation. 
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The  increased  FI  center  frequency  for  whispered  vowels  can 
be  attributed  to  the  greater  vertical  tongue  constriction  used 
when  producing  whispered  vowels.  Figures  7  and  8  provide  graphic 
evidence  of  greater  tongue  height  and  greater  constriction  in  the 
pharyngeal  area  for  whispered  vowels  than  for  phonated  vowels. 

It  is  well  known  that  vowel  FI  center  frequency  is  associated 
with  movements  of  the  tongue  within  the  vertical  plane  of  the 
oral  cavity.  It  is  also  well  known  that  the  frequency  of  FI  is 
raised  by  constriction  of  the  pharynx1^. 

A  comparison  of  the  adult  male  phonated  vowel  formants  with 
those  reported  by  Peterson  and  Barney13  (p  &  B)  and  Fairbanks 
and  Grubbs11*  indicates  very  close  agreement  (see  Table  7).  The 
mean  differences  between  the  Peterson  and  Barney  male  FI ,  F2  and 
F3  frequencies  and  the  LPC-10  male  phonated  FI,  F2  and  F3 
frequencies  were  45  Hz,  116  Hz  and  53  Hz,  respectively. 

Similarly,  Table  8  shows  that  adult  female  phonated  vowel 
formants  compared  quite  favorably  with  the  adult  female  formants 
reported  by  Peterson  and  Barney.  The  differences  between  the  P  &  B 
adult  female  FI  frequencies  and  the  LPC  adult  female  phonated  FI 
frequencies  were  71  Hz,  126  Hz  and  220  Hz,  respectively. 

These  results  would  suggest  that  the  linear  predictive 
coding  fundamental  frequency  and  formant  frequency  extraction 
techniques  are  adequate  when  compared  with  the  data  obtained  from 
conventional  sound  spectrographic  techniques.  It  should  be 
pointed  out  however  that  the  adult  females'  F3  values  as  reported 
by  the  present  study  and  those  reported  by  Peterson  and  Barney 
are  different  enough  to  suggest  some  measurement  disparity.  The 
present  data  did  not  permit  the  specification  of  the  direction  of 
the  disparity. 

In  an  effort  to  assess  the  LPC  vocoder's  ability  to  process 
whispered  speech,  the  LPC  measured  whispered  vowel  formants  as 
produced  by  adult  females  were  compared  to  the  female  whispered 
vowel  formant3  reported  by  Kallail  and  Emanuel  (K  &  E)  which  were 
measured  with  the  sound  spectrograph.  These  data  are  presented 
in  Table  9.  The  mean  differences  between  LPC  measured  whispered 
FI,  F2  and  F3  and  the  spectrographic  whispered  FI,  F2  and  F3  were 
89  Hz,  282  Hz  and  311  Hz,  respectively.  Although  these  results 
compare  favorably,  all  of  the  vowel  F3  values  measured  with  the 
sound  spectrograph  were  higher  than  the  F3  values  that  were 
obtained  with  the  LPC  vocoder.  A  similar  result  was  noticed 
earlier  when  the  Peterson  and  Barney  female  F3  values  were 
compared  with  the  LPC  F3  values.  This  trend  suggests  one  of  the 
two  measurement  techniques  underestimated  or  overestimated  F3 
center  frequency  values.  Additional  research  is  needed  to 
clarify  this  issue. 


In  summary,  the  data  obtained  from  this  study  suggests  the 
following: 

1.  The  LPC-10  vocoder’s  performance  in  processing  whispered 
and  phonated  speech  when  compared  with  conventional  sound  spectro- 
graphic  techniques  measured  up  quite  favorably  for  the  first  two 
formants  in  adult  male  and  female  speakers.  The  LPC  vocoder's 
ability  to  process  the  higher  formants  needs  further  research. 

2.  The  LPC-10  vocoder  processed  voices  with  pitch  extremes 
quite  well.  Its  fundamental  frequency  tracking  precision  compared 
very  closely  with  previously  reported  spectrographic  data  for 
male  and  female  vowels. 

3.  The  most  salient  acoustic  difference  between  phonated 
and  whispered  vowels  in  connected  speech  was  an  increase  in  the 
frequency  of  the  first  three  formants,  the  greatest  increase 
occurring  at  FI  for  males  and  females. 

4.  Whispered  vowels  had  wider  formant  bandwidths  than  pho¬ 
nated  vowels  for  male  and  female  spoken  vowels. 

5.  For  phonated  vowels,  most  of  the  spectral  energy  was 
concentrated  at* the  first  two  formants  for  males  and  females  while 
the  majority  of  the  spectral  energy  shifted  from  the  lower  two 
formants  to  the  F3  area  for  male  and  female  whispered  vowels. 

6.  The  LPC-10  vocoder’s  synthesized  phonated  and  whispered 
vowel  waveforms  and  qualities  and  unprocessed  phonated  and  whis¬ 
pered  vowel  waveforms  were  barely  discernible,  although  the 
differences  between  the  synthesized  and  unprocessed  whispered 
vowel  waveforms  and  qualities  were  slightly  more  noticeable  than 
the  differences  between  the  unprocessed  and  synthesized  phonated 
vowel  waveforms  and  qualities  (see  Figure  11). 


VII.  RECOMMENDATIONS 


Although  the  results  of  the  LPC-10  vocoder’s  processing  of 
phonated  and  whispered  speech  and  the  data  obtained  from  previous 
spectrographic  analysis  compared  rather  well,  there  is  still  a 
need  for  further  research  which  systematically  tests  the  LPC 
vocoder’s  ability  to  process  more  intermediate  qualities  such  as 
nasality,  harshness,  stridency,  etc. 

It  would  also  be  exremely  important  to  determine  the 
influence  of  dialectal  differences  (regional  and  foreign)  on  the 


processing  ability  of  the  LPC-10  vocoder.  There  is  a  dearth  of 
research  relative  to  this  issue. 

Finally,  there  is  a  need  for  research  to  study  the  influenc 
of  various  suprasegmental  or  prosodic  speech  features  such  as 
intonation,  rate  and  duration  on  LPC  vocoding  at  different  bit 
rates,  particularly  at  2400  bits/second  and  lower. 


PHONATED 


1 . 

The 

syllable 

heed 
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syllable 

who ' 

d  is  the  word. 
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WHISPERED 
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is  the  word. 
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who' 

d  is  the  word. 
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The 
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is  the  word. 
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The 

syllable 

hud 

is  the  word. 

Figure  1.  The  stimulus  material  used  by  each  of  the  11  subjects 
to  produce  the  .experimental  vowel  samples.  Each  of  the  vowels 
was  produced  with  the  same  consonantal  environment  while  the  vowel 
embedded  words  were  contained  in  the  same  carrier  sentences. 


Figure  4.  An  oscillographic  display  of  an  adult  male's  pro¬ 
duction  of  a  phonated  ( P ]  and  whispered  [W]  vowel  /a/  prior 
to  LPC  processing.  Note  that  the  vowel  is  embedded  in  the 
word  "Hud". 
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Figure  5.  A  digital  spectrogram  [SGM]  with  formant  tracking 
[FTR]  for  an  adult  male  speaker's  production  of  a  phonated 
(Pi  and  whispered  [W]  vowel  /*/  embedded  in  the  word  "Hud". 
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Figure  7.  Frequency  plot  (FPL)  of  8  frames  of  an  adult  male 
speaker's  spectral  tilt  variations  during  the  production  of 
(a)  phonated  /A/  and  /b/  whispered  /a/. 


Figure  10.  An  FFT  raw  spectrum  (FDI)  and  smooth  spectrum 
(SSP)  of  an  adult  male's  (a)  phonated  and  (b)  whispered 
production  of  the  vowel  A/. 


Figure  11.  An  oscillographic  display  of  an  adult  male's 
(a)  natural  production  of  the  phonated  and  whispered  vowel 
/a/  and  (b)  the  synthesized  waveform  of  the  adult  male's 
production  of  the  phonated  and  whispered  vowel  /a/. 


Table  1.  Average  formant  frequency  and  fundamental  frequency 
values  for  male  phonated  and  whispered  productions  of  the 
experimental  vowels.  Standard  deviations  are  given  in  paren 
theses  below  each  of  the  mean  values.  Negative  differences 
indicate  that  the  whispered  values  are  larger  than  the  pho¬ 
nated  values. 
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Table  2.  Average  formant  bandwidth  values  for  male  phonated 
and  whispered  productions  of  the  experimental  vowels.  Standard 
deviations  are  given  in  parentheses  below  each  of  the  mean 
values.  Negative  differences  indicate  that  the  whispered  values 
are  greater  than  the  phonated  values. 
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Table  3.  Average  formant  amplitude  values  for  male  phonated  and 
whispered  productions  of  the  experimental  vowels.  Standard  devi 
ations  are  given  in  parentheses  below  each  of  the  mean  values. 
Negative  differences  indicate  that  the  whispered  values  are 
greater  than  the  phonated  values. 
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Table  6.  Average  formant  amplitude  values  for  female  pho- 
nated  and  whispered  productions  of  the  experimental  vowels 
Standard  deviations  are  given  in  parentheses  below  each  of 
the  mean  values.  Negative  differences  indicate  that  the 
whispered  values  are  greater  than  the  phonated  values. 
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Table  7.  The  mean  formant  frequencies  and  fundamental  frequen 
cies  for  adult  male  phonated  vowels  as  determined  by  the 
LPC  vocoder  and  the  mean  formant  frequencies  and  fundamental 
frequencies  for  adult  male  vowels  reported  by  Peterson  and 
Barney  (P  i  El  as  measured  with  a  sound  spectrograph. 
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Table  9.  Mean  formant  frequencies  for  adult  female  whispered 
vowels  reported  by  Kallail  and  Emanuel  (K  &  E)  and  the  mean 
formant  frequencies  for  adult  female  whispered  vowels  as 
measured  with  the  LPC  vocoder.  Negative  differences  indicate 
that  the  LPC  values  are  greater  than  the  K  4  E  values. 
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